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Abstract 

Drought is a major environmental stress limiting growth of perennial grasses in temperate regions. Plant drought toler- 
ance is a complex trait that is controlled by multiple genes. Candidate gene association mapping provides a powerful 
tool for dissection of complex traits. Candidate gene association mapping of drought tolerance traits was conducted 
in 192 diverse perennial ryegrass (Lolium perenne L.) accessions from 43 countries. The panel showed significant 
variations in leaf wilting, leaf water content, canopy and air temperature difference, and chlorophyll fluorescence 
under well-watered and drought conditions across six environments. Analysis of 109 simple sequence repeat mark- 
ers revealed five population structures in the mapping panel. A total of 2520 expression-based sequence readings 
were obtained for a set of candidate genes involved in antioxidant metabolism, dehydration, water movement across 
membranes, and signal transduction, from which 346 single nucleotide polymorphisms were identified. Significant 
associations were identified between a putative LpZ.£A3 encoding late embryogenesis abundant group 3 protein and 
a putative LpFeSOD encoding iron superoxide dismutase and leaf water content, as well as between a putative LpCyf 
Cu-ZnSOD encoding cytosolic copper-zinc superoxide dismutase and chlorophyll fluorescence under drought condi- 
tions. Four of these identified significantly associated single nucleotide polymorphisms from these three genes were 
also translated to amino acid substitutions in different genotypes. These results indicate that allelic variation in these 
genes may affect whole-plant response to drought stress in perennial ryegrass. 

Key words: Antioxidant, association mapping, Cyt Cu-ZnSOD, drought, FeSOD, LEA3, leaf water content, Lolium perenne. 



Introduction 

Perennial ryegrass (Lolium perenne L.) is a cool-season peren- 
nial grass species from the family Poaceae. Native to Europe, 
Asia, and northern Africa, this species is widely cultivated in 



the temperate regions of the world as a high quality forage 
and for landscaping, as well as for other elements of ecosys- 
tem services such as carbon sequestration, soil formation 
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and protection, and nutrient cycling. Perennial ryegrass is 
a self-incompatible diploid (2«=2a = 14) outcrossing species 
(Cornish et al., 1980). Among the major perennial grass spe- 
cies, perennial ryegrass is a good model for studying asso- 
ciation of gene and drought tolerance because of its diploid 
genetics, available genetic and genomics resources, rapid stress 
responses, and available germplasm collections. Knowledge 
gained from studying this species will facilitate further inves- 
tigation in other perennial grass species with more complex, 
polyploid genomes. 

Drought stress is an important factor affecting growth 
and development of cool-season perennial grasses. Plant 
drought tolerance is a complex quantitative trait, involving 
multiple metabolic pathways. A number of genes involved in 
plant drought responses and tolerance have been identified 
in model plants and crops (Seki et al., 2002; Rabello et al., 
2008; Guo et al., 2009; Khowaja et al, 2009; Campo et al., 
2012). In rice (Oryza sativa L.) and Arahidopsis thaliana, two 
groups of genes have been reported to be involved in drought 
responses: genes encoding functional or structural proteins 
and genes encoding regulatory proteins (Rabbani et al, 2003; 
Shinozaki and Yamaguchi-Shinozaki, 2007). Genes in anti- 
oxidative metabolic pathways are among the key groups of 
functional genes, because these antioxidant pathways play 
an important role in detoxifying reactive oxygen species that 
can accumulates under drought or other abiotic stress condi- 
tions. In perennial ryegrass, ~10% of differentially expressed 
genes were associated with functions of detoxification (Liu 
and Jiang, 2010). Transformation of pea (Pisum sativum L.) 
VsMnSOD encoding manganese superoxide dismutase to rice, 
and overexpression of Arahidopsis AtAPX encoding cyto- 
solic ascorbate peroxidase and AtMDAR encoding monode- 
hydroascorbate reductase in tobacco (Nicotiana tabacum L.) 
improved drought tolerance of the targeted plants (Badawi 
et al, 2004; Wang et al, 2005; Ehayeb et al, 2007). Other 
functional or regulatory proteins also promote drought toler- 
ance, including late embryogenesis abundant (LEA) proteins 
(Duan and Cai, 2012) functioning in maintenance of protein 
and membrane structure (Bartels and Sunkar, 2005), and the 
aquaporins plasma membrane intrinsic proteins (PIPs) and 
tonoplast intrinsic proteins (TIPs) that facilitate the diffusion 
of water and uncharged solutes across membranes (Maurel 
et al, 2008). Overexpression of OsPIPl genes in drought- 
sensitive lowland rice enhanced its drought avoidance by 
increased root hydraulic conductivity, and higher leaf water 
potential and relative cumulative transpiration (Lian et al., 
2004). Overexpression of BnPIPl in transgenic tobacco 
plants showed improved tolerance to drought stress by main- 
taining normal leaf rigidness after 10 d of water depriva- 
tion, while the wild type already showed severe leaf wilting 
(Yu et al, 2005). These findings suggest that expression of 
candidate genes encoding functional or regulatory proteins 
may enhance drought tolerance in plants. At the population 
level, the relationship between these genes and drought toler- 
ance in a diverse population panel of a plant species is largely 
unknown; specifically, whether allelic diversity at these genes 
underlies phenotypic variation associated with whole-plant 
tolerance to drought stress. 



Linkage disequilibrium (LD)-based association mapping is 
a powerful tool for genetically dissecting complex traits con- 
trolled by multiple quantitative trait loci (QTLs) (Yu et al. , 
2006a; Zhao et al , 2007; Harjes et al , 2008; Tian et al , 2009; 
Emanuelh et al., 2010; Li et a/., 2011; Sk0t et a/., 2011; Cook 
et al, 2012). Through exploitation of historical recombina- 
tion events at a population level, association studies identify 
marker-trait associations and test large numbers of alleles in 
a diverse population (Yu and Buckler, 2006). For a species 
in which LD decays rapidly, candidate gene association map- 
ping is appropriate to relate sequence variations in selected 
candidate genes to specific traits of interest (Zhu et al. , 2008). 
Rapid LD decay within 300-2000 bp has been detected in per- 
ennial ryegrass, depending on the population size and indi- 
vidual genes (Skot et al., 2007; Xing et al, 2007; Brazauskas 
et a/. , 201 1 ; Fill et al , 201 1 ). A candidate gene, FLOWERING 
LOCUS T, has been found to be associated with changes in 
flowering time across a range of populations of perennial 
ryegrass (Sk0t et al. , 201 1). In maize {Zea mays L.), candidate 
gene association mapping identified single nucleotide poly- 
morphisms (SNPs) in genes that affect abscisic acid levels in 
floral tissue during drought stress (Setter et al, 2011). These 
findings indicate that the candidate gene association mapping 
approach can be effectively used to establish associations 
between targeted genes with known function and complex 
traits in outcrossing species. 

The objective of this study was to conduct candidate 
gene association mapping of drought tolerance in perennial 
ryegrass. It was hypothesized that diverse perennial ryegrass 
varied greatly in whole-plant physiological traits related to 
drought tolerance. Given the evidence that genes involved 
in antioxidant pathways and in other functional or regula- 
tory systems may play a role in drought tolerance, it was 
also hypothesized that these candidate genes were signifi- 
cantly associated with drought tolerance traits in perennial 
ryegrass. In this study, whole-plant physiological responses of 
a global collection of 192 diverse perennial ryegrass acces- 
sions to drought stress were examined across six environ- 
ments. Population structure and relative pairwise kinship of 
the population were determined by using a set of 109 simple 
sequence repeat (SSR) markers. Gene-trait associations were 
analysed after controlling population structure. 



Materials and methods 

Plant materials 

A global collection of 186 perennial ryegrass (L. perenne L.) acces- 
sions was obtained from the USDA National Plant Germplasm 
System at the Western Regional Plant Introduction Station in 
Pullman, WA, USA, and six turf-type commercial cultivars were 
obtained from the seed industry (Turf-Seed Company, Gervais, OR, 
USA; Scotts Inc., Marysville, OH, USA). This collection was based 
on geographical locations of accessions to maximize ecotype diver- 
sity (Supplementary Table SI available at JXB online). The panel 
included 72 wild, 66 cultivated, and 54 accessions with uncertain 
pedigree according to germplasm bank classification. All the acces- 
sions were confirmed as diploid by flow cytometry (Y. Wang et al. , 
2009). A single seed from each accession was initially sown in small 
plastic pots containing a sandy loam soil with a pH of 6.9. Each 
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accession was then propagated multiple times by tillers to maintain 
genetic uniformity. The grasses were planted in the field in Wanatah 
(41°26'N and 86°54'W), West Lafayette (40°25'N and 86°54'W), and 
Vincennes (38°40'N and 87°3rW) of Indiana, USA in August 2008, 
respectively. Soil type was sandy loam in Wanatah and Vincennes, 
and silt loam in West Lafayette. Each location had three replica- 
tions for each accession with the same genotypes across locations. 
The maintenance of plants in the field followed the routine practices 
of perennial ryegrass during the experiment. A total of 179 acces- 
sions were alive after winter 2008 and were used for phenotypic data 
collection. Drought stress was imposed in the plots by withholding 
water for 7-10 d after initial measurements were taken under well- 
watered conditions in May at each location in 2009 and 2010. On 
average, soil water content and mean air temperatures under well- 
watered and drought conditions were 15.0% and 12.1% and 20.6 °C 
and 23.1 °C across 2 years, respectively. The detailed information on 
sampling dates, soil water, and air temperatures in all three locations 
are shown in Supplementary Table S2. All accessions were under 
drought stress for the same amount of time. The typical climatic pat- 
tern at the field site included multiple dry spells each lasting 7-15 d 
during the growing season, which caused light to severe leaf wilting 
in the perennial ryegrass accessions. 



Phenotypic traits 

Data were collected before and after drought stress at three loca- 
tions in 2009 and 2010 for the following traits: canopy and air tem- 
perature difference (CAD), leaf water content (LWC), chlorophyll 
fluorescence (FJF^), and leaf wilting. CAD was measured using 
a hand-held infrared thermometer (AGRLTHERM IH, Everest 
Interscience Inc., Tuscon, AZ, USA). LWC was measured according 
to the following equation: LWC (g g-')=(FW-DW)/DW, where FW 
is the leaf fresh weight and DW is the dry weight. FW was meas- 
ured immediately after leaves were cut in the field. The leaves were 
then dried in an oven at 75 °C for 72 h to obtain the DW. FJF^ 
was measured on 4—5 arbitrarily selected leaves for each accession at 
2200 h using a fluorescence meter (OS-30P, OPTI-Sciences, Hudson, 
NH, USA). It allowed a minimum of 45min for leaf dark adap- 
tation prior to sampling, which was sufficient for photoinhibited 
leaves to recover their fluorescence ability (Demmig and Bjorkman, 
1987). Leaf wilting was assessed by visual rates using a scale of 1 
(slight wilting) to 9 (severe wilting). CAD, LWC, and FJF^ were 
measured under both well-watered and drought-stressed conditions, 
while leaf wilting was only assessed under drought stress. Endophyte 
Neotyphodium lolii infection in each plant accession was determined 
using a tissue print immunoblot procedure (Agrinostics Ltd. Co., 
Watkinsville, GA, USA). 

To examine the consistency and accuracy of phenotypic data, 
repeatability of phenotypic traits across six environments (2 yearsx3 
locations) as reflected by heritability (h^) was calculated using PROC 
MIXED (SAS Institute, Version 9.1, Cary, NC, USA). The h^ was 
calculated as follows: \^-a^l{a^+a^^le+a^lre.), where a^, o^, and 
Oge^ represent Type III SS (sums of squares) for genotype (G), envi- 
ronment (E), and GxE, respectively. The 'e' is degree of freedom of 
E and 're' is the degree of freedom of GxE. Least square means were 
calculated based on the pooled data from six environments x three 
replications since h^ was >0.5 for all traits (Supplementary Table S3 
at JXB online). Analysis of variance (ANOVA) was calculated using 
SAS PROC MIX with both accessions and drought stress as fixed 
effects, and environments and GxE as random factors. 



Molecular profiling 

Young leaves from each accession were collected, frozen in liquid 
N, and stored at -80 "C for DNA isolation using a cetyltrimethyl 
ammonium bromide (CTAB) method (Murray and Thompson, 
1980). DNA concentration was quantified using a NanoDrop 
ND-1000 spectrophotometer (NanoDrop Technologies Inc., 
Rockland, DE, USA) and diluted to 15ng |.i,l"' for polymerase chain 



reaction (PCR). A total of 109 published genome-wide SSR markers 
(Kubik el al., 2001; Jensen et al, 2005; Lauvergeat et ciL, 2005; GiU 
et al, 2006; King et al, 2008) across seven chromosomes in peren- 
nial ryegrass were screened in all accessions (Supplementary Table 
S4 at JXB online). The forward primer sequence was modified by 
adding an M13 tail (5'-ACG ACG TTG TAA AAC GAC) to the 5' 
end, and the M13 primers were labelled with four fluorescent dyes of 
different coloirs [FAM (blue), VIC (green), PET (yellow), and NED 
(red)] for multiplex analysis in ABI 3730 (Applied Biosystems Inc., 
Foster City, CA, USA). Each 10 ^1 PCR reaction consisted of Ix 
PCR buffer, 2.5mM MgClj, 0.2 mM dNTP mix, 0.05 ^iM forward 
tailed primer, 0.1 |xM reverse primer, 0.05 |iM fluorescent-labelled 
Ml 3 primer, l.OU of Taq DNA polymerase, and 60 ng of DNA. AU 
PCRs were performed in a 384-well iCycler thermocycler (Bio-Rad 
Inc., Hercules, CA, USA) using a touch-down program (Yu et al, 
2006b). In brief, PCR was started at 95 "C for 5niin, then ran for 
four cycles at 96 °C for 1 min, 68 °C for 5 min, with a decrease of 2 °C 
in each consequent cycle; and 72 °C for 1 min; four cycles of 96 "C 
for 1 min, 58 °C for 2 min, with a decrease of 2 °C in each consequent 
cycle, and 72 °C for 1 min; followed by 24 cycles of 96 °C for 1 min, 
50 °C for 1 min, 72 °C for 1 min, with a final extension of 72 °C for 
5 min and 4 °C for 5 min. The amplified fragments were separated 
in an ABI 3730 DNA Sequencer (Applied Biosystems Inc.). Alleles 
were called using GeneMarker 1.6 software (SoftGenetics, LLC, 
State College, PA, USA) and manually checked twice for accuracy. 
Allele size differing in at least 2 bp was considered to be polymorphic 
between accessions. If a primer amplified more than one fragment 
in an accession and these fragments appeared in different accessions, 
they were counted as different loci. All confirmed polymorphic 
aUeles were used for structure analysis. 



Population structure and relative kinship 

Population structure was determined by model-based clustering 
using STRUCTURE 2.3.1 (Pritchard et al, 2000). The structure 
was run 10 times by setting pre-defined k (the number of population 
groups) from 1 to 10 using admixture models, with 10 000 burn-in 
time and 10 000 iterations of Markov chain convergence for each 
run. The optimal number of groups was determined when k was 
set to five. Subsequently, 10 independent runs were performed by 
setting k from three to six, with burn-in time of 50 000 and replica- 
tion number of 100 000 for each run. This did not produce different 
results from those of the preliminary runs. Among the 10 runs, the 
one with the highest likelihood value (k-S) was selected to assign 
the proportion of membership for each accession. Relative pairwise 
kinship was calculated using SPAGeDi (Hardy and Vekemans, 2002) 
with all 109 polymorphic SSR markers. The coefficient of Loiselle 
(Loiselle et al , 1 995) was used to obtain the pairwise kinship matrix. 
The distributions of pairwise kinship for 192 x 192 data points were 
summarized using Excel's internal function 'frequency'. 



Candidate gene selection, sequencing, and SNP identification 

Fourteen candidate genes were selected for sequencing, including 10 
putative genes encoding antioxidant enzymes and additional putative 
genes of LEA, PIP, TIP, and MAPK (encoding niitogen-activated 
protein kinase) (Supplementary Table S5 at JXB online). Previous 
results in perennial ryegrass showed that genes involved in detoxifi- 
cation play a role in drought tolerance (Liu and Jiang, 2010), there- 
fore more genes were selected in the antioxidative pathways. Primers 
were designed based on various sources, including GenBank acces- 
sion AJ634002 for CAT encoding catalase, EU054285-EU054295 
for MAPK, AB533342.1 for LEA3, FJ472349.1 for TIPl, the per- 
ennial ryegrass expressed sequence tag (EST) for PIPl, and the 
conserved coding region of orthologous genes from closely related 
species such as rice, Brachypodium distachyon, sorghum {Sorghum 
bicolor L.), and maize for other genes used in this study. Primers for 
amplification of PCR fragments for sequencing of the 14 candidate 
genes are listed in Supplementary Table S5. 
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Both genomic DNA and reverse-transcribed cDNA were used as 
PCR amplification templates for synthesis and sequencing of these 
genes. Direct genomic DNA sequencing often resulted in unclean 
sequence readings because of the high outcrossing and heterozygous 
nature of perennial ryegrass; thus, the more problematic introns 
were excluded for sequencing in this study by doing the following. 
First, the exon distribution of all these candidate genes was plotted 
in NCBI Splign service (Kapustin et al , 2008) by using the orthol- 
ogous genes from rice and Arabidopsis. Secondly, each of the two 
candidate genes, putative CAT and MnSOD, had one long exon that 
covered the major part of that gene; therefore, the primers for these 
two genes were designed based on a single exon, and genomic DNA 
was used as PCR template. The other 12 genes usually formed by 
short exons, thus an mRNA reverse-transcribed cDNA was used to 
design the primer. 

Total RNA was isolated from leaf tissues using TRIzol® Reagent 
(Invitrogen, Carlsbad, CA, USA) and was treated with DNase 
(TURBO DNA-Free Kit; Anibion, Austin, TX, USA) to remove 
contaminating genomic DNA. The first-strand cDNA was syn- 
thesized from 2 |.i,g of DNase-treated RNA using an iscript cDNA 
synthesis kit (Bio-Rad, Hercules, CA, USA). PCR products with 
the correct length were confirmed using agarose gel and visualized 
in gel image system (UVP, Upland, CA, USA). The target frag- 
ments were cut and recovered using a 96-well high-throughput gel 
recovery kit (Zymo Research, Irvine, CA, USA). Sequencing reac- 
tions were carried out by using Big Dye Terminator kit version 3.1 
(Applied Biosystems Inc.). Sequencing was conducted using an 
ABI 3730 genetic analyzer according to the manufacturer's instruc- 
tions (Apphed Biosystems Inc.) in the Genomic Center at Purdue 
University. 

Diploid SNPs were identified using the NovoSNP program 3.0.1 
Microsoft Windows Platform version (Weckx et al., 2005). This 
software can deal with heterozygous sequence data. Multiple align- 
ments were performed automatically, and SNP position and the het- 
erozygous allele were detected using the software. Rare SNPs were 
excluded for SNP counting when the total non-major allele count 
was <5% (~8 accessions in this population). After manual verifica- 
tion guided by quality score, diploid SNP information with physical 
positions was obtained. 

Linkage disequilibrium and association mapping analysis 

The LD was calculated for each candidate gene using TASSEL 2.1 
(Bradbury et al. , 2007). The generated value was plotted against 
the physical distance among each pair of SNPs. Meanwhile, data 
from all candidate genes was pooled together to generate the overall 
LD for perennial ryegrass. Candidate gene association analysis was 
performed on the 179 perennial ryegrass accessions using the GLM 
by implementing Q model and simple linear model with the TASSEL 
2.1 software package (Bradbury et al. 2007). Each SNP was fit as a 
fixed effect one at a time to test the association between the SNP and 
phenotype. Minor allele with frequency (MAP) <5% was filtered 
prior to association analysis. After this filtering, SNPs that passed 
the threshold of P- value <10"''' were deemed significant according 
to procedure of Sukumaran et al. (2012) with modifications. Briefly, 
the threshold P-value was determined jointly by considering that 
these are SNPs from candidate genes and their numbers are not very 
large and by considering the pattern of the quantile-quantile (Q-Q) 
plot of the selected model and the point at which the observed P-test 
statistics deviated from the expected P-test statistics (Suppelmentary 
Fig. SI at /XP online). 

In silico sequence analysis for functional substitution(s) 

The sequence variations of putative LpCyt Cu-ZnSOD, LpFeSOD, 
and LpLEA3 showing significant association with traits were further 
analysed to identify putative functional amino acid substitution(s). 
DNA sequences were aligned using the ClustalW2 online service 
(Larkin et al., 2007). The amino acid sequence of each peptide was 



directly inferred in NCBI using BlastX and matched with other 
orthologous proteins in other plant species. The predicted second- 
ary structures for the proteins were obtained using PredictProtein 
(Rost et al., 2004) and Jpred (Cole et al., 2008). The phylogenetic 
tree was constructed in ClustalW2- Phylogeny (http://www.ebi. 
ac.uk/Tools/phylogeny/clustalw2_phylogeny/) by alignment of the 
sequences of plant species including perennial ryegrass, Italian 
ryegrass {Lolium miiltiflorum Lam.), maize, rice, Arabidopsis, sor- 
ghum, wheat (Triticiiin aestivum L.), barley (Hordewn vulgare L.), 
and cotton {Gossypium arboretum L.), and incorporating into the 
Neighbor- Joining method with bootstrap analysis of 1000 repli- 
cates. The calculated tree file was viewed in Tree View version 1.6.6 
software (http://taxonomy.zoology.gla.ac.uk/rod/treeview.html). 

Results 

Variations in drought response 

Drought stress caused leaf wilting and increased CAD, 
reduced LWC, but did not affect FJF^ in the panel of per- 
ennial ryegrass accessions (Supplementary Table S6 at JXB 
online). Accessions differed significantly in CAD, FJFj^. 
LWC, and leaf wilting (Table 1; Supplementary Table SI, 
Supplementary Fig. S2). Among all accessions across six 
environments, CAD ranged from -2.67 °C to 2.08 °C, FJF^ 
ranged from 0.75 to 0.83, and LWC ranged from 2.76 (g 
g ') to 6.17 (g g ') under well-watered conditions (Table 1; 
Supplementary Fig. S2). Under drought conditions, the 
minimum and maximum values were 0.53 °C and 6.53 °C for 
CAD, 0.71 and 0.84 for FJF^, 1.80 (g g-') and 3.93 (g g-') 
for LWC, and 2.7-8.0 for leaf wilting, respectively (Table 1; 
Supplementary Table SI). Significant correlations were found 
between CAD, FJF^, and LWC under well-watered and 
drought conditions, respectively (Table 2; Supplementary Fig. 
S3). Under drought stress, FJF^ was negatively correlated 
with CAD and leaf wilting; LWC was negatively correlated 
with leaf wilting, whereas CAD was positively correlated 
with leaf wilting. Under well-watered conditions, such cor- 
relations were not observed (Table 2). Twenty-six accessions 
were endophyte infected. Analysis of similarities indicated no 
significant differences in the overall drought tolerance of the 
perennial ryegrass accessions based on endophyte infection 
(Global 7?=215, P=0.216, 999 permutations). The lack of any 
significant pattern of drought tolerance among infected and 



Table 1. Range of canopy and air temperature difference (CAD), 
leaf water content (LWC) cliloroplnyll fluorescence (Fy/FJ, and 
wilting for 1 79 diverse perennial ryegrass accessions under control 
(0) and drought (D) conditions. 





CAD (°C) 








LWC (g g-') 


Wilting 




C 


D 


C 


D 


C 


D 


D 


Minimum 


-2.67" 


0.53 


0.746 


0.707 


2.76 


1.80 


2.7 


Maximum 


2.08 


6.53 


0.826 


0.836 


6.17 


3.93 


8.0 


Mean 


-1.08 


2.75 


0.808 


0.798 


4.18 


2.57 


4.6 


SD" 


0.92 


0.99 


0.010 


0.016 


0.53 


0.32 


0.94 



^ Data summarized across three locations for 2009 and 2010. 
Standard deviation. 
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Table 2. Pearson correlation coefficients among canopy air temperature (CAD), chloropliyll fluorescence (Fv/FJ, leaf water content 
(LWC), and leaf wilting (wilting) under well-watered conditions (C) and drought stress (D). 



Traits 


C-CAD 




C-LWC 


D-CAD 




D-LWC 


D-wilting 


C-CAD 


1 
















0.01 


1 












C-LWC 


0.02 


-0.06 


1 










D-CAD 


0.56*** 


-0.06 


-0.14 


1 










-0.10 


0.50*** 


0.05 


-0.16* 


1 






D-LWC 


-0.06 


-0.06 


0.34*** 


-0.02 


0.06 


1 




D- wilting 


-0.08 


-0.07 


0.08 


0.18* 


-0.15* 


-0.15* 


1 



*, **, and *** indicate significant at P < 0.05, P < 0.01 , and P < 0.001 , respectively. 



uninfected accessions was confirmed visually using a two- 
dimensional rendering resulting from the non-metric multidi- 
mensional scaling analysis (stress=0.1). 

The overall heritability of various traits was high, ranging 
from 0.51 to 0.86 among six environments in 2009 and 2010 
(Supplementary Table S3 at JXB online), indicating high 
repeatability over testing environments; thus least square 
means (Ismeans) of individual traits were calculated across 
all environments tested for association analysis. 

Population structure and relative kinship 

A total of 2180 SSR alleles were scored for 109 SSR mark- 
ers across 192 accessions (Supplementary Table S4 at JXB 
online). The average number of alleles per marker, major 
allele frequency, polymorphism information content, and 
the overall mean gene diversity was 20, 0.34, 0.76, and 0.79, 
respectively (Supplementary Table S7). 

The STRUCTURE analysis identified five groups (Gl, G2, 
G3, G4, and G5) in this panel of perennial ryegrass (Fig. 1) 
based on likelihood plots of the models, stability of group- 
ing patterns across different runs, and germplasm informa- 
tion (M. Wang et al. , 2009). The mean major allele frequency, 
gene diversity, and observed heterozygosity level ranged 
from 0.36 to 0.40, 0.71 to 0.76, and 0.42 to 0.53 across the 
five groups, respectively (Supplementary Table S7 at JXB 
online). Gl was the largest and most diverse group, with 121 
accessions of mixed origins, including all the accessions from 
Oceania and the majority of the accessions from the USA, 
Canada, Europe, and South America. Approximately 89.4% 
of cultivated materials were assigned into Gl including 10 
out of 1 1 turfgrass cultivars. Further distinct subgroups were 
not detected within Gl after independent structure analysis 
(data not shown). G2 contained 21 accessions; from Europe 
(14), Asia (6), and one turf-type commercial cultivar from 
the USA. G3 was the second largest group, with 25 acces- 
sions; mainly from Europe (13), Africa (7), and Asia (3). G4 
contained 1 3 accessions, mainly from North Africa (Algeria, 
Morocco, and Tunisia). G5 was the smallest, with 12 acces- 
sions mainly from Southern Europe and Asia. There was 
no obvious kinship in this population. More than 55% of 
the pairwise kinship estimates were zero while ~35% were 
between zero and 0.05, indicating that 90% of estimates were 



<0.05. The 10% of estimates of >0.05 represented the mini- 
mum relative kinship among samples (Supplementary Fig. 
S4) and should not cause further complexity in association 
analysis. 

Heterozygous sequences and LD decay 

A total of 14 candidate genes involved in antioxidant metab- 
olism, dehydration, and water movement across membrane 
and signal transduction were sequenced to analyse sequence 
variation and find SNPs (Table 3). A total of 346 SNPs with 
an average of one SNP per 21 bp were identified in 7288 bp of 
the expressed regions across the 14 candidate genes. The num- 
ber of SNPs per gene ranged from seven in putative LpMAPK 
to 50 in LpMDAR, while the average SNP frequency ranged 
from one SNP per 1 1 bp in LpCAT and LpLEA3 to one SNP 
per 75 bp in MAPK (Table 3). Across the population, hete- 
rozygous genotypes occurred up to 21.4% in total, ranging 
from 1.3% of AIT to 7.9% of A/G (Supplementary Table S8 
at JXB online). Multiple alleles (e.g. AA, A/T, A/G, TT, or 
GG showed up in the same position of different accessions) 
were observed in some of the polymorphism positions, sug- 
gesting a rapid sequence evolution in perennial ryegrass, an 
outcrossing species. 

Combined LD analysis of 13 genes (Lp^PXwas dropped 
because of many uncertain SNPs) showed LD decay close 
to 0.1 in <lkb (Fig. 2), although variations in LD patterns 
were noted for each individual gene (Supplementary Fig. 
S5 at JXB onhne). The putative LpCAT, LpChl Cu-ZnSOD 
encoding chloroplastic copper-zinc superoxide dismutase, 
LpMnSOD, LpFeSOD encoding iron superoxide dismutase, 
LpCyt Cu-ZnSOD, LpDHAR encoding dehydroascorbate 
reductase, hpMAPK, hpLEA3, hpPIPl encoding type 1 
plasma membrane intrinsic protein, and LpTIPl encoding 
type 1 tonoplast intrinsic protein showed a rapid LD decay 
pattern, with values close to or <0. 1 within 200-500 bp. LD 
decay of LpGR encoding glutathione reductase dropped to 
0.1 within 1 kb. The putative hpMDAR and LpGPX encoding 
glutathione peroxidase had relatively higher values from 
0.3 to 0.5 at a distance >300-600bp (Supplementary Fig. S5 
at JXB online). Diverse LD decay rates in perennial ryegrass 
might indicate a biased evolution of individual genes within 
the same species. 
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Fig. 1 . Genetic relatedness of 1 92 perennial ryegrass accessions with 1 09 simple sequence repeats (SSRs) as analysed by the 
STRUCTURE program. Numbers on the y-axis indicate the membership coefficient. The colour of the bar indicates the five groups 
identified through the STRUCTURE program (G1=red, G2=green, G3=blue, G4=yellow, and G5=pink). Accessions with the same colour 
belong to the same group. 



Association between genes and traits 

Five population structures and minimum relative pairwise 
kinship were detected in the perennial ryegrass population 
used for this study (Fig. 1; Supplementary Fig S4 at JXB 
online). Therefore, the population structure controlled model 
(Q model) was selected as the best fit for association map- 
ping analysis of genes and traits, which was also confirmed 
by quantile-quantile plots of i^-test value (Supplementary 
Fig. SI). After removal of non-polymorphic and low-quality 
SNPs, a total of 346 SNPs were used for association analy- 
sis. By implementing the Q model, three SNPs at loci 134, 
220, and 287 from LpCyt Cu-ZnSOD, one at locus 41 1 from 
hpFeSOD, and two at loci 237 and 285 from LpLEA3 were 
significantly associated with FJF^, LWC, and LWC under 
drought stress, respectively, with a P-value ranging from 
9.02 X 10"^ to 4.94 x 10"* (Table 4). Compared with the simple 
linear model, this was an elimination of ~70% false-positive 
correlations under drought stress (Supplementary Table S9). 
Since locus 77 from LpLEA3 had a significant association 
with D-LWC under a f- value <0.01 level and linkage dise- 
quilibrium with locus 237 and 285 from LpLEA3, this locus 
along with another six significant associated loci were further 
analysed for amino acid substitutions among different hap- 
lotypes. marker values ranged from 0.075 to 0.097 among 
all those associated SNPs. On average, each marker explained 



8.9% (±0.8%) of phenotypic variance for its associated trait 
(Table 4). Four amino acid substitutions were identified from 
significant associated LpLEA3 and LpCyt Cu-ZnSOD, and 
gene-affecting traits were further analysed. In contrast, the 
significant associated SNP from LpFeSOD resulted in syn- 
onymous substitution and no amino acid change was found 
(Supplementary Fig. S6); therefore, it was not included for 
further analysis. 

Putative LpLEA3 affecting leaf water content 

The LpLEA3 is a putative LEA3 gene in perennial ryegrass, 
and an orthologue of wheat cold accumulated gene ( WCS19) 
(Chauvin et al., 1993) and Itahan ryegrass LEA3 (Lcsl9) 
(Oishi et al., 2010). The total sequenced length of LpLEA3 
was ~450bp, varying by accession. Its nucleotide sequence 
had up to 98% identity to Lcsl9 and could align well until 
the 5' stop codon. Based on the alignment results, the first 
48 bp coding sequence of Lcsl9 was used to impute the first 
48 bp of LpLEA3. A 447 bp full-length LpLEA3 gene was 
identified coding a 148 amino acid residue peptide with a 
mol. wt of 14.5 kDa. There was a 22-mer signal peptide at 
the N-terminus (Supplementary Fig. S7 at JXB online), 18 
strongly basic (+) amino acids (K, R), 22 strongly acidic (-) 
amino acids (D, E), and 38 alanines (A, 25.7%) in this pre- 
dicted protein. 
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Table 3. Summary of genes used in this study and tine number of single nucleotide polymorphism (SNP) sites in each gene across a 
diverse perennial ryegrass population. 



Putative gene 


Full name 


Li" 


L2 


L3 


Rate'' 


SNP 


Freq.'' 


LpCAr 


Catalase 


1478 


577 


505 


93.8 


45 


11 


LpChI Cu-ZnSOD 


Chloroplast Cu-Zn superoxide dismutase 


621 


428 


400 


98.4 


21 


19 


LpGPX 


Glutathione peroxidase 


509 


545 


446 


77.5 


15 


30 


\jpMnSOD 


Manganese superoxide dismutase 


695 


289 


245 


90.6 


15 


16 


LpAPX 


Ascorbate peroxidase 


755 


585 


555 


99.5 


>^r 


33 


LpMDAR 


Monodehydroascorbate reductase 


1307 


930 


780 


95.6 


50 


16 


LpDHAR 


Dehydroascorbate reductase 


641 


525 


506 


98.4 


13 


39 


LpGR 


Glutathione reductase 


1490 


1130 


1087 


99.5 


44 


25 


LpFeSOD 


Iron superoxide dismutase 


767 


550 


537 


90.7 


24 


22 


LpCyf Cu-ZnSOD 


Cytosolic Cu-Zn superoxide dismutase 


459 


421 


355 


91.8 


17 


21 


l^MAPK 


Mitogen-activated protein kinase 


1109 


612 


522 


98.4 


7 


75 


LpLEA3 


Late embryogenesis abundant, group 3 


446 


446 


350 


90.1 


31 


11 


LpPIPI 


Plasma membrane intrinsic protein, type 1 


866 


544 


495 


98.9 


21 


24 


LpTIPI 


Tonoplast intrinsic protein, type 1 


750 


55 


505 


97.8 


26 


19 


Total 14 




11 892 


8087 


7288 


94.3 


346 


21 



DNA template used for LpCATand LpMnSOD; all others used RNA-cDNA template. 

Li, gene CDS length; L2, total length covered by the primer(s) designed in this study; L3, final readable sequenced length. 
Sequencing rate (%), the proportion of accessions that contain readable sequencing compared with all accessions tested; this was not L3/L2 
orLg/Li. 

Frequency of SNP, calculated by L3/SNP, stands for the average SNP per bp length. 
" Seventeen SNPs were confirmed, but many uncertain SNPs were found. 



Phylogenetic analysis of ~500 LEA genes from differ- 
ent plant species confirmed that the putative LpLEA3 gene 
belonged to subfamily three (Supplementary Fig. S8 at JXB 
online) according to the common features of LEAS that the 
gene family was positively charged at the N-terminus and 
negatively charged at the C-terminus, and was rich in ala- 
nine (>20%) and in polar residues. In this study, LpLEA3 
showed a very similar structure to WCS19 (Chauvin et ciL, 
1993): the C-terminal half was rich in the acidic amino acids 
aspartate and glutamate, thus giving this region a net nega- 
tive charge of -10, whereas the N- terminal half was rich in 



the basic amino acids lysine and arginine, having a net posi- 
tive charge of +6, and rich in alanine (25.7%). A high rate of 
amino acid substitutions was found during the evolution of 
this gene (Fig. 3). Phylogenetic analysis of LEA3 subfamily 
members in the selected species revealed that LpLEAS was 
more closely related to its orthologue in Italian ryegrass, rye 
{Secale cerealL.), wheat, barley and B. distachyon than to rice 
and Arahidopsis (Fig. 3). 

After comparison of sequences from the majority of the 
exon regions in LpLEA3 (Fig. 4A), D-LWC was found to 
be significantly associated with three SNPs at position 104, 
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Fig. 2. Linkage disequilibrium (LD) decay in perennial ryegrass. Plots of squared correlations of allele frequencies (r^) against physical 
distance between pairs of SNPs in the 13 pooled genes. The LD pattern of individual genes can be found in Supplementary Fig. S5 at 
JXB online. (This figure is available in colour at JXB online.) 
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Table 4. Association of candidate genes witin traits of perennial 
ryegrass assessed by using a population structure controlled 
model (Q model). 



Trait 


Putative Gene 


Locus 


H _Marker 


P_Marker 


D-F„/F, 


LpCyf Cu-ZnSOD 


134 


0.0969 


9.02E-05 


D-F„/F, 


LpCyf Cu-ZnSOD 


220 


0.0828 


2.48E-04 


D-FyF, 


LpCyt Cu-ZnSOD 


287 


0.0752 


4.94E-04 


D-LWC 


LpLEA3 


285 


0.0905 


3.92E-04 


D-LWC 


LpLEA3 


237 


0.0914 


3.63E-04 


D-LWC 


LpLEA3 


77 


0.0625 


0.0048" 


D-LWC 


LpFeSOD 


411 


0.0945 


2.63E-04 



^ Locus 77 was included for further analysis due to its significant 
association under the P < 0.01 level and linkage disequilibrium with 
locus 237 and 285. 

D-FJF^n, chlorophyll fluorescence under drought conditions; D-LWC, 
leaf water content under drought conditions. 



264, and 312 of LpLEA3 (Fig. 4A), corresponding to loci 77, 
237, and 285 in Table 4, respectively. Further analysis of all 
accessions using these three SNPs identified 90 wild-type, 22 
heterozygous mutation (Hz-M), seven homozygous mutation 
(Ho-M), and 38 recombinant genotypes (R1-R4) (Fig. 4A). 
The D-LWC values of Hz-M and Ho-M genotypes were sig- 
nificantly higher than those of the wild genotype accessions, 
which indicated a dominance genetic model of this gene 
(Fig. 4A). Among these three SNPs, at the synonymous sub- 
stitution position 104, only the Hz-M (C:G) genotype had 
significantly higher D-LWC than the wild genotype (C:C), 
but not the Ho-M (G:G) genotype (Fig. 4B). In contrast, 

r AAA16282.2 
— AAM09564.1 
r- ACJ68104.1 

r Perennial ryegrass 
BAJ07539.1 

I XP 003581642.1 

I AAS49101.1 

' I NP 001053831.1 

I CAK1 2529.1 

I XP 002447073.1 

I NP 001146861.1 

0.1 



Fig. 3. Phylogenetic tree of the late embryogenesis abundant 
group 3 protein sequence from plant species. NCBI protein 
accession numbers were used to indicate homologous proteins 
except perennial ryegrass (Lolium perenne). Bar=0.1 amino acid 
substitutions per site. AAA1 6282.2, wheat {Triticum aestivum); 
AAI\/109564.1 , rye (Secale cereal); ACJ68104.1 , Barley (Hordeum 
vulgare); BAJ07539.1 , Italian ryegrass {Lolium multiflorum); 
XP_003581 642.1 , Brachy podium distachyon; AAS491 01 .1 , 
Arabidopsis thaliana; NP_001 053831 .1 , rice (Oryza satlva); 
CAK12529.1 , Sporobolus stapfianus; NP_001 146861 .1 , maize 
(Zea mays); and XP_002447073.1 , sorghum {Sorghum bicolor). 



Hz-M (A:G) and Ho-M (A:A) types had significantly higher 
D-LWC than the wild type (G:G) at two non-synonymous 
substitution positions 264 and 312 (Fig. 4B). Among these 
three SNP positions, two amino acid substitutions, from gly- 
cine (G) to serine (S) at residue 89 and from alanine (A) to 
threonine (T) at residue 105, were identified (Fig. 4C). Both 
these substitutions were from non-polar residues (G, A) to 
polar residues (S, T), which might improve the hydrophiUc 
feature of this peptide and contribute to water absorption or 
conservation and eventually drought tolerance of plants. 

Putative LpCyt Cu-ZnSOD affecting F/Fr^ 

A partial coding sequence (356 out of 459 bp) of LpCyt 
Cu-ZnSOD was identified in perennial ryegrass. Genotype 
A: A in LpCyt Cu-ZnSOD exhibited a significantly higher 
than that of genotype A:T under drought conditions 
(Supplementary Fig. S9 at JXB online). A low rate of amino 
acid substitutions was found during the evolution of this 
gene (Fig. 5). The three SNPs (loci 134, 220, and 287) from 
LpCyt Cu-ZnSOD significantly associated with FJF^ under 
drought stress (D-FJF^) had two amino acid substitutions 
with one from alanine (A) to glutamate (E) at residue 40 
and the other from E to valine (V) at residue 91 from the 
N-terminus (Fig. 6). One of the substitutions was from a 
non-polar, neutral residue to a polar, acidic residue, while the 
other was from a polar, acidic residue to a non-polar, neutral 
residue. Moreover, the two non-polar, neutral residues (A and 
V) had very similar structures, which might indicate a com- 
plementary substitution within the same gene. Phylogenetic 
analysis showed that LpCyt Cu-ZnSOD in perennial ryegrass 
was more closely related to that in barley, wheat, and B. dis- 
tachyon than to that in rice, zoysiagrass (Zoysia japonicd), 
bamboo (Bambusa oldhamii), tobacco, cotton, Aradidopsis, 
soybean {Glycine max L ), and populus {Populus suaveolens) 
(Fig. 5). 

Discussion 

Drouglit responses of perennial ryegrass 

Leaf wilting, CAD, FJF^, and LWC provide rapid and easy 
measurements for whole-plant responses; thus, they have been 
used to screen drought-tolerant plant materials and char- 
acterize drought tolerance at the whole-plant level (O'Neill 
et al, 2006; Jansen et al., 2009; Jiang et al., 2009; Luo et ai, 
2011). Relatively low LWC, positive CAD values, and high 
leaf wilting symptoms potentially indicate the occurrence of 
drought stress. Large variations in these traits across acces- 
sions of perennial ryegrass and further correlation analysis of 
traits suggest that they are adequate parameters for assessing 
drought tolerance in the field. High canopy temperature of 
grass plants may indicate a lower transpiration, and sensitive 
stomatal closure may act to maintain leaf water status and 
prevent leaf wilting in the field. 

FJF^ is an indicator of photochemical efficiency in some 
species (Baker, 2008), but its value remained unchanged in 
other perennial grass species such as Kentucky bluegrass {Poa 
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(A) 
TSS 



ATG 



TAA 



Poly A 
— AAA 



D-LWC (%) 


Genotype 


104 (77) 


264 (237) 


312 (285) 


accessions 


2.496" 


Wild 


C:C 


G:G 


G:G 


90 


2.721' 


Hz-M 


C:G 

1 


A:G 

A-/1 


A:G 
A:A 


22 
7 


2.538 


Rel 


C:G 


G:G 


G:G 


13 


2.462 


Re2 


G:G 


G:G 


G:G 


12 


2.672 


Re3 


G:G 


A:G 


A:G 


8 


2.761 


Re4 


G:G 


A:A 


A:G 


5 



(B) 




(C) 



G:G C:G"---CjC 



A-A A:G G:G 



A:A A:G G:G 



Wild 1 MASSSVLLGAAGAAVLTGTSAGKAVPRPSFPjAAR|RTVTGGRLCLQTPPR;^SPEYNKSAD 60 

Hz-M 1 masssvllgaagaavltgtsagkavprpsplaarIrtvtggrlclqtppraspeynksad 60 

Ho-M 1 masssvllgaagaavltgtsagkavprpsbI'laarIrtvtggrlclqtpp'raspeynksad 60 

********************** r ,' 

Wild 61 ATEDAIEGAKGVAAELKKGVAEAAAAVS | DTDKAVEATEEGTSE i AAKATDLGEQVKGAA 120 

Hz-M 61 ATEDAIEGAKGVAAELKKGVAEAAAAVS^BdTDKAVEATEEGTSE^BaAKATDLGEQVKGAA 12 0 

Ho-M 61 ATEDAIEGAKGVAAELKKGVAEAAAAVS S DTDKAVEATEEGTSE T AAKATDLGEQVKGAA 120 



Wild 121 EEAWDGAKDAAKGVTDKVAAEAKAAAKE 148 
Hz-M 121 EEAWDGAKDT^KGVTDKVAAEAKAAAKE 148 
Ho-M 121 EEAWDGAKDAAKGVTDKVAAEAKAAAKE 148 

Fig. 4. Variant alleles and association mapping at LpLEA3. (A) Gene structure and genotypes analysis of LpLEA3: wild type, 
heterozygous mutation (Hz-M), homozygous mutation (Ho-M), and recombinants one to four (R1-R4) genotypes were identified on the 
basis of LpLEA3 sequence from 1 58 perennial ryegrass accessions. The positions of these three significant SNPs were shown using 
the full-length Lpi_£43 gene sequence as reference, with the start codon designated as position zero. The SNP positions at 104, 264, 
and 312 were located according to the reference mentioned above, whereas 77, 237, and 285 (in parentheses) were SNP positions 
relative to the SNP calling reference (accession 120 sequence). The proposed wild-type genotype was highlighted in green, Hz-M in red, 
Ho-M in yellow, and R1-R4 in black, respectively Values of leaf water content under drought conditions (D-LWC) with the same letter 
were not significantly different at P < 0.05 for the wild type, Hz-M, and Ho-M. TSS, transcription start site; arrowed band, untranslated 
region; thick bar, downstream region after the stop codon; blank box, exon; thin line, intron; ATG, start codon; TAA, stop codon; blue 
shaded area, region sequenced in this study. (B) Leaf water content under drought stress (D-LWC) against its significant associated SNP 
markers. Values beside dashed blue lines indicate P-values of association mapping results. The wild-type genotype was highlighted in 
green, Hz-M in red, and Ho-M in yellow, respectively Values of D-LWC with the same letter were not significantly different at P < 0.05. 
(C) Amino acid substitutions in LpLEA3 among wild type, Hz-M, and Ho-M. Underlining with 21 asterisks indicate its signal peptide. 
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ABF48717.1 

NP 001077494.1 

NP 001235298.1 

ACI46676.1 

CAA39444.1 

I AEV43309.1 

I NP 001105704.1 

I NP 001060564.1 

I ACX94084.1 

I XP 003562484.1 

I Perennial ryegrass 

I BAJ94548.1 

I ACO90194.1 

0.1 



Fig. 5. Phylogenetic tree of cytosolic copper-zinc superoxide 
dismutase sequence from plant species. NCBI protein accession 
numbers were used to indicate Inomologous protein except 
perennial ryegrass (Lolium perenne). Bar=0.1 amino acid 
substitutions per site. ABF4871 7.1 , populus (Populus suaveolens); 
N P_001 077494 . 1 , Arabidopsis thaliana ;NP_001235298.1, 
soybean (Glycine max); ACI46676.1 , cotton (Gossypium 
arboretum), CAA39444.1 , tobacco (Nicotiana plumbaginifolia); 
AEV43309.1 , zoysiagrass (Zoysia japonica); NP_001 105704.1 , 
maize (Zea mays); NP_001 060564. 1 , rice (Oryza sativa); 
ACX94084.1 , bamboo (Bambusa oldhamii); XP_003562484.1 , 
Brachypodium distachyon; BAJ94548.1 , barley [Hordeum 
vulgare); and ACO901 94.1 , wheat [Triticum aestivum). 

pratensis L.) and prairie junegrass (Koeleria macrantha) under 
drought stress even when LWC decreased to 68-70% (Bian and 
Jiang, 2009; Jiang et al, 2010). The present resuhs indicated 
that overall F^/F^ was unaffected by drought stress in the field 
across locations but was significantly different among acces- 
sions (Supplementary Tables SI, S6 at JXB online). In addi- 
tion, FJF^ was also correlated with CAD and leaf wilting of 
perennial ryegrass under drought stress, indicating its poten- 
tial use for screening drought tolerance of plant populations. 
Further reductions in FJF^ were associated with increasing 



intensity of drought stress in different plant species (Zulini 
et al., 2007; Luo et a/., 2011). 

Collectively, diverse phenotypic responses of perennial 
ryegrass accessions to drought stress are crucial for gene- 
trait association. In this study, leaf wilting and other drought 
response traits were studied in perennial ryegrasses exposed 
to 7-10 d of drought stress; however, stress could last longer 
than this duration in some areas where perennial ryegrass are 
grown, which may further affect phenotypic trait variation 
in germplasm. The potential variability in short-term versus 
long-term drought response should be taken into considera- 
tion when developing perennial grasses for improved drought 
tolerance. 

SNP frequency and LD decay 

The average SNP frequency in the 14 candidate genes was one 
SNP every 21 bp across 192 accessions of perennial ryegrass. 
This was similar to the average frequency of one SNP per 
24-33 bp for 11 disease resistance candidate genes in L. per- 
enne (Xing et al, 2007), but higher than one SNP every 94 bp 
in five genes for shoot morphology in perennial ryegrass 
(Brazauskas et al, 2011) and one SNP per 127bp in nine 
genes for flowering time in 20 genotypes of perennial ryegrass 
(Fill et ai, 2011). The variation in SNP frequencies detected 
in different studies may be attributed to the population size 
and nature of individual genes in a particular plant species. 
LD is a measure of non-random association between alleles 
at different loci. The presence of LD is a prerequisite for asso- 
ciation mapping of genes related to phenotypes. Currently, 
the threshold to indicate LD decay is r < 0.1, < 0.2, or r^ 
reduced to half of its initial value. Generally, LD decays more 
rapidly in outcrossing species than selfing species (Nordborg 
et al. , 2002; Kim et al , 2007; Brazauskas et a/. , 201 1 ; Li et al. , 
2011). A rapid LD decay indicates that more recombina- 
tion exists within a short distance and that more markers are 
required to capture the high frequency of recombination (Zhu 
et al, 2008). In selfing species, LD decayed within 10-50kb 
on average in Arabidopsis (Nordborg et al , 2002; Aranzana 



WT 1 FASSEGVKGTIFFTQEGDGPTTVTGSVSGLKAGLHGFHVHALGDTTNGCMSTGPHFNPAG 60 

FASSEGVKGTIFFTQEGDGPTTVTGSVSGLK GLHGFHVHALGDTTNGCMSTGPHFNPAG 
DS 1 FASSEGVKGTIFFTQEGDGPTTVTGSVSGLKEGLHGFHVHALGDTTNGCMSTGPHFNPAG 60 

WT 61 HVHGAPEDEIRHAGDLGNVTAGEDGVAPINFVDKHIPLTGPHSIIGRAWVHGDADDLGK 12 0 

HVHGAPEDEIRHAGDLGNVTAG DGVAPINFVDKHIPLTGPHS I IGRAWVHGDADDLGK 
DS 61 HVHGAPEDEIRHAGDLGNVTAGVDGVAPINFVDKHIPLTGPHSIIGRAWVHGDADDLGK 12 0 

WT 121 GGHELSR 127 

GGHELSR 
DS 121 GGHELSR 127 

Fig. 6. Non-synonymous SNPs resulted in amino acid substitutions for putative LpCyt Cu-Zn SOD in perennial ryegrass. WT, wild type; 
DS, double substitutions. The first amino acid residue substitution from A to E occurred at position 32 (position 40 in reference to Cyt 
Cu-ZnSOD peptide of other species), which corresponded to an SNP at nucleotide position 287 in reference accession 92; the second 
amino acid residue substitution from E to V occurred at position 83 (position 91 in reference to Cyt Cu-ZnSOD peptide of other species), 
which corresponded to SNP at nucleotide position 134 in reference accession 92. This mutation A (E) reversed another mutation from G 
(A) to C (E) with in the same gene. (This figure is available in colour at JXB online.) 



et al, 2005; Kim et al, 2007) and 100-200 kb in rice (Mather 
et fl/., 2007; McNally et al, 2009; Huang et al., 2010). A rapid 
decay within a distance of 500-1 500 bp was found in sev- 
eral outcrossing species such as maize (Remington et al., 
2001; Tenaillon et al, 2001), rye (Li et al, 2011), sunflower 
{Helianthus annuus L.) (Liu and Burke, 2006), Douglas fir 
(Pseudotsuga menziesii) (Krutovsky and Neale, 2005), and 
perennial ryegrass (Xing et al, 2007; Brazauskas et ai, 2011). 
Previous studies in perennial ryegrass found that LD decay 
was at 300 bp to 2kb, depending on population size and genes 
tested (Skot et al, 2007; Xing et ai, 2007; Brazauskas et al, 
2011; Fiil et al, 2011). The overall LD decay of all genes close 
to 0.1 of ~lkb found in this study was consistent with pre- 
vious studies on candidate genes related to shoot morphol- 
ogy (Brazauskas et al, 2011) and flowering time in perennial 
ryegrass (Fiil et al, 2011). However, different candidate genes 
differed in rates of LD decay (relatively rapid or slow) across 
this diverse population of perennial ryegrass, which may indi- 
cate a non-uniformity of plant genome evolution. 

Features of putative LpLEA3 and LpCyt Cu-ZnSOD 

LEA proteins are extremely hydrophilic proteins that were 
originally discovered in cotton during seed germination 
(Dure et al, 1981). Their high content of water-interacting 
residues facilitates the scavenging of water molecules when 
a cell experiences changes in water status (Colmenero-Flores 
et al., 1999). They are thought to be associated with dehy- 
dration, cold and salt tolerance (NDong et al, 2002; George 
et al, 2009; Olvera-Carrillo et al., 2010), acting as protectors 
of macromolecules or cellular structures and preventing pro- 
tein aggregation during water deficit (Bray, 1997). However, 
their physiological and molecular functions still remain 
largely unknown. 

LEA proteins have been classified into between five and 
nine subfamilies on the basis of sequence similarity (Bies- 
Etheve et al. , 2008) or eight subfamilies as defined by the Pfam 
database (Punta et al., 2012), the conserved domain, and the 
InterPro databases (Hunault and Jaspard, 2010). There were 
only two three-dimensional structures available for this entire 
LEA gene family (ToUeter et al , 2007), making it challenging 
to investigate its conserved domain and function-structure 
relationships. In Arahidopsis, 51 LEA proteins were classified 
into nine distinct groups with different amino acid lengths 
(Hundertmark and Hincha, 2008). Although genes within 
the LEAS subfamily shared higher similarity than LEA genes 
from other LEA subfamilies, the diversity was still large 
within this subfamily, with the length ranging from 147 to 
383 amino acids. LEA3 genes have a perfect 11-mer tandem 
repeats in some cereal crops (Dure, 1993) including sorghum 
(GQ494000.1), maize (NM_001111828.1), and B. distachyon 
(XP_003580519.1). Some putative LEA3 proteins such as 
WCOR14 (ACV60217.1) in common beet {Beta vulgaris L.) 
have intermediate 11-mer repeats, while others such as Lcsl9 
in Itahan ryegrass (Oishi et al, 2010) have imperfect 11-mer 
repeats. The results also showed imperfect 11-mer repeats in 
\^^EA3. These results indicate a dynamic evolution of the 
LEA3 gene family across plant species. 
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The majority of LEA proteins have been predicted to be 
highly hydrophilic and natively unstructured (Hundertmark 
and Hincha, 2008), which indicates a potential high hydro- 
phihc ability with more polar residues sitting on this kind 
of unstructured peptide. In this study, both substitutions in 
LEA3 protein of perennial ryegrass were from non-polar res- 
idues (G, A) to polar residues (S, T). This might improve the 
hydrophilic nature of this unstructured peptide, thus enhanc- 
ing cellular water retention and benefiting drought tolerance. 
Since the populations of perennial ryegrass were diverse with 
historical recombination events, the substitutions may be one 
of the evolutionary patterns of the LpLEA3 gene in peren- 
nial ryegrass. Furthermore, these amino acids substitutions 
were in line with SNPs at positions 237 and 285 in LpLEA3, 
which were significantly associated with D-LWC. These 
results indicate that LpLEA3 may facilitate drought toler- 
ance by influencing the water status of plants. LpLEA3 was 
a homologue of Lcsl9, a cold-stimulated gene identified in 
Italian ryegrass. The distribution of Lcsl9 homologues has 
been found among temperate forage grasses but not in tropi- 
cal grass species (Oishi et al, 2010). It suggests that LpLEA3 
may be a suitable gene for studying the drought response of 
temperate grasses with more complex genomes. 

SODs play a role in protecting against superoxide tox- 
icity (Bowler et al, 1992). The primary sequences of 
FeSOD and MnSOD apoproteins were related, whereas 
that of Cu-ZnSOD was distinct (Kliebenstein et al., 1998). 
Cu-ZnSODs are largely located in the cytoplasm, sharing 
80-90% similarity at the amino acids level across species 
(Crapo et al., 1992; Alscher et al , 2002). The putative LpCyt 
Cu-ZnSOD in perennial ryegrass had two amino acid sub- 
stitutions in fine with two SNPs associated with FJF^ under 
drought stress; however, the amino acids changed from 
A (non-polar, non-charged) to E (polar, negatively charged) 
and from E to V (non-polar, non-charged) with a very sim- 
ilar molecular structure between A and V (Fig. 6). These 
results suggest that these amino acid substitutions of LpCyt 
Cu-ZnSOD might not be functionally associated with FJF^ 
of perennial ryegrass. The bBlastx result indicated that this 
gene had up to 97% sequence similarity compared with 50 
other plant species (data not shown). For example, peren- 
nial ryegrass and Arahidopsis shared 85% sequence simi- 
larity, with the same peptide residue length of 152 amino 
acids (Supplementary Fig. SIO at JXB onhne), indicating a 
highly conserved feature of Cyt Cu-ZnSOD across different 
species. 

Allelic effects on complex traits 

Significant allelic effects (8-10%) were found for stress tol- 
erance in association results within perennial ryegrass. The 
difference in D-LWC between genotypes A:A and G:G in 
LpLEA3 was 6.8% (Fig. 4B). Similarly, genotype A:A and 
A:G in LpCyt Cu-ZnSOD had a 2% difference in D-FJF^ 
(Supplementary Fig. S9 at JXB onhne). In maize, SNPs from 
Igl and lg2 explained the leaf angle difference by -0.8 ° and 
-1.13 ° (Tian et al, 201 1). In their study, a total of 203, 287, 
and 295 SNPs were significantly associated with upper leaf 
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angle, leaf length, and leaf width, respectively; demonstrat- 
ing that the genetic architecture of the leaf traits was domi- 
nated by small allele effects. These allehc effects indicate that 
association mapping analysis of complex quantitative traits 
in outcrossing species could be challenging and that larger 
populations will be needed to uncover alleles with smaller 
effects. Nevertheless, the advantageous allele from LpLEAJ 
and LpCy/ Cu-ZnSOD discovered in this study could be con- 
sidered as a target candidate for gene transformation and for 
further development of markers for breeding programmes 
to improve drought tolerance of perennial ryegrass and 
other polyploid perennial grasses. Introgression mapping to 
introduce genes of chromosome segments from polyploid to 
diploid grasses may also be effective for improving drought 
tolerance of forage and turfgrass (Humphreys et al., 2003, 
2005). It is expected that the genome sequence of peren- 
nial ryegrass will become available in the future, which will 
undoubtedly impact biology research, gene identification, 
and cultivar development of perennial ryegrass as well as 
other grasses with more complex genomes. 

Conclusions 

Five subpopulations have been identified in 192 global peren- 
nial ryegrass accessions. After controlling population struc- 
ture, six associations were identified between candidate genes 
involved in antioxidant metabolism and dehydration and 
traits of FJF^ and LWC under drought stress. Two SNPs in 
the putative LpLEA3 and one SNP in LpFeSOD were signifi- 
cantly associated with variation in LWC and three SNPs in 
putative LpCyt Cu-ZnSOD were significantly associated with 
variations in FJF^ under drought stress. Molecular charac- 
terization of these significantly associated SNPs in LpLEAS 
of perennial ryegrass identified two amino acid substitutions, 
both from a non-polar residue to a polar residue, that may 
enhance the hydrophilicity of this peptide and contribute to 
variation in leaf water content under drought stress. The study 
provided valuable information for candidate gene-based asso- 
ciation mapping of drought tolerance in highly heterozygous 
perennial grasses. 



Supplementary data 

Supplementary data are available at JXB online. 

Figure SI. Quantile-quantile plots of F-te&t value for 
model comparisons. 

Figure S2. Distribution of canopy and air temperature dif- 
ference (CAD), chlorophyll fluorescence (FJF^), leaf water 
content (LWC), and leaf wilting under well-watered (C) and 
drought stress (D) conditions, respectively. The values were 
least square means calculated for all accessions across six 
environments. 

Figure S3. Two-dimensional plots of significant correla- 
tions (P < 0.05 level) among canopy and air temperature dif- 
ference (CAD), chlorophyll fluorescence (FJF^), leaf water 
content (LWC), and leaf wilting under well-watered (C) and 
drought stress (D) of perennial ryegrass. 



Figure S4. Distribution of pairwise relative kinship esti- 
mates across 192 accessions of perennial ryegrass. 

Figure S5. LD decay patterns of individual candidate 
genes of LpCAT, LpChl Cu-ZnSOD, LpGPX, LpMnSOD, 
LpMDAR, LpDHAR, LpGR, LpFeSOD, LpCyt Cu-ZnSOD, 
LpMAPK, hpLEA3, hpPIPl, and hpTIPl related to plant 
drought response. 

Figure S6. Synonymous SNPs did not cause a change in 
amino acid sequence for putative LpFeSOD in perennial 
ryegrass. 

Figure S7. Cleavage site prediction of putative LpLEAS 
gene in perennial ryegrass. 

Figure S8. Phylogenetic tree of ~500 late embryogenesis 
abundant group 3 (LEA3) protein sequences from plants. 

Figure S9. Relationship between chlorophyll fluorescence 
{FJFJ and SNP markers in LpCyt Cu-ZnSOD. 

Figure SIO. The sequence similarity of Cyt Cu-ZnSOD 
between perennial ryegrass and Arabidopsis. 

Table SI. Origin, status information, and drought response 
traits of diverse perennial ryegrass accessions. 

Table S2. Sampling date, soil moisture, and air tempera- 
tures in three locations before and after drought in 2009 
and 2010. 

Table S3. Repeatabihty of phenotypic traits of perennial 
ryegrasses across six environments under well-watered and 
drought conditions. 

Table S4. Information on 109 simple sequence repeat (SSR) 
primers across seven linkage group of perennial ryegrass. 

Table S5. Primers used for amplification of 14 candidate 
genes in perennial ryegrass. 

Table S6. Analysis of variance (ANOVA) of phenotypic 
traits of perennial ryegrasses across six environments under 
well-watered and drought conditions. 

Table S7. Summary statistics for all accessions and indi- 
vidual groups (G1-G5) within the panel of perennial ryegrass 
accessions identified by STRUCTURE analysis. 

Table S8. Number of genotypes found in 12 candidate 
genes, LpC/7/ Cu-ZnSOD, LpGPX, LpMnSOD, LpMDAR, 
LpDHAR, hpGR, LpFeSOD, LpCyt Cu-ZnSOD, LpMAPK, 
LpLEA3, LpPIPl, and LpTIPl, across 192 accessions of 
perennial ryegrass. 

Table S9. Number of associations among 14 genes and 
traits under drought conditions using a simple linear model 
(S) and population structure (Q) controlled model in peren- 
nial ryegrass. 
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